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Abstract 

We present a new proof that the statement ’every sugbroup of a free 
group is free’ implies the Axiom of Choice for finite sets. 


1 Introduction 

In 1921, Nielsen |4j proved that every subgroup of a finitely generated free group 
is free. This result was generalised to arbitrary free groups by Schreier [5] in 
1927, giving us the following result. 

NS (Nielsen-Schreier): If F is a free group and I\ < F is a subgroup, then K is 
a free group. 

Since every proof of NS uses the Axiom of Choice, it is natural to ask whether 
it is equivalent to the Axiom of Choice. The first step was made by Lauchli [3] , 
who showed that NS cannot be proved in ZF set theory with atoms. Jecli and 
Sochor’s embedding theorem [2] allows this result to be transferred to standard 
ZF set theory. It was improved in 1985 by Howard [I], who showed that NS 
implies AC fi ni the Axiom of Choice for finite sets: 

AC fi n (Axiom of Choice for finite sets): Every set of non-empty finite sets has 
a choice function. 

Another Choice principle used in this article is the Axiom of Choice for pairs: 

AC 2 (Axiom of Choice for pairs): Every set of 2-element sets has a choice 
function. 

The purpose of this paper is to provide a new and shorter proof of Howard’s 
result. 
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2 Nielsen-Schreier implies AC fi n 


Before beginning the proof, we must fix some notation and terminology. If X 
is a set, let X~ = {a: -1 : x £ X} be a set of formal inverses of X. It does 
not matter what the elements of X~ are, as long as X~ is disjoint from X. 
Members of X ± = X Ul _ are called X-letters. Finite sequences Xi ■ ■ ■ x n 
with xi,...,x n £ X ± are X-words. An X-word x\---x n is X-reduced if aq ^ 
a;A_i for i = 1 ,...,n — 1. If a is an X-word, the X-reduction of a is the X- 
reduced X -word obtained by performing all possible cancellations within a. 
For notational simplicity, we don’t distinguish between A'-words and their X- 
reductions. Reference to X is omitted if X is clear from the context. 

If G is a group and SCG, then (S) is the subgroup of G generated by S. 

Definition 1. Let X be a set. The free group on X, written F( A), consists of all 
reduced A'-words. The group operation is concatenation followed by reduction, 
and the identity is the empty word 1. 

A group G is free if it is isomorphic to F(X) for some A' C G. If this is the 
case, A is a basis for G. 

The following proofs will start with a family Y of non-empty sets and construct 
a choice function c : Y —> (J Y. Without loss of generality, we assume that the 
members of Y are pairwise disjoint. We then define X = (J Y to be the basis of 
the free group F = F( A). With every y £ Y we associate a function a y : F — > 7L 
which counts the number of occurrences of y-letters in words a £ F as follows. 

Write a = x^ 1 ■ ■ ■ x^ as an X-reduced word with aq,..., x n £ X and 
d,..., e n £ {±1}- Then define 

a v( a ) = \{i ■■ Xi £ y Aei = 1}| - \{i : Xi £ y A et = —1}|. 

It is easily checked that, for each y £ Y , a y is a group homomorphism 
from the free group F to the additive group of integers. 

Before proving theorem [3] we handle a special case in lemma [2] Its proof serves 
as an introduction to ideas used in the proof of the main theorem. 

Lemma 2. ZF b NS =>■ AC 2 

Proof. Let Y be a family of 2-element sets. Without loss of generality, assume 
that the members of Y are pairwise disjoint. 

Let A' = (J Y, let F = F( A) be the free group on A', and define the subgroup 
K <F by 

K = ({wx -1 : (By £ Y)w,x £ y}). 

By the Nielsen-Schreier theorem, K has a basis B. Note that 

a y (a) = 0 for all y £ Y and all a £ I\. (1) 
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We will construct a choice function for Y, i.e. a function c : Y —» X satisfying 
c(y ) £ y for each y GY. 

Let y £ Y. Define the function s y : y —> y to swap the two elements of y. For 
any choice of x £ y, y = {x, s y (x)}. To simplify notation, we set Xi = s l y {x ) for 
all i £ Z; hence y = {xo,Xi}. Express xqx± and XiXq 1 as reduced B-words: 

1 = bo.i''' bo,i 0 

xiXq 1 = 61,1 - - - fei.il, 

where bij £ B ± for all i,j. As Xox] -1 = (xiXq ) -1 , it follows that lo = h = l, 
say, and that 

bi,i = b ol > bi,i = ^o,i- (2) 


There are two cases: 

(i) l is odd: 

Let m = (l — l)/2. The middle B-letter of Xqx/ 1 is bo,m+i> whereas the 
middle 13-letter of XiXq 1 is m+i by ©• One of these two is in 

B , while the other is in B~. Define c(y) to be the unique element x £ y 
such that the middle -B-letter of xsy(x) -1 is a member of B. 

(ii) l is even: 

Let to = 1/2. The following two functions are the key to the proof. 

f y : U -t K : Xi 6j,i • • • b^ m 
: y~>F : x /^(x) -1 • x 

The idea of / y is to map Xj to the ’first half’ of XjxZ^ in terms of the new 
basis B. f y (x) is intended to represent x in K. 

Using ©> we obtain 

fy( x i)fyi x i+ 1) = bj,l ' ' ' ■ ■ ■ b i+11 

bqi ■ * ■ (3) 

= ^Ha¬ 
lt follows that g y (xo) = g y (x i). Hence the image of y under g y has a single 
member, a y , say. Note that 

<J y( a y) = cr y(9y( x o)) 

= o- y (/ y (xo) _1 x 0 ) , 4 . 

= O- y (/ y (x 0 ) _1 ) +cr.y(x 0 ) 

= 0 + 1 using ©, f y (x 0 ) £ K, and x 0 £ y 

is non-zero. This means that a y mentions at least one y-letter. So we 
define c(y) to be the y-letter which appears first in the X-reduction of a y . 
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□ 


We are now ready to prove the general case: 

Theorem 3. ZF b NS => A Cf in . 

Proof. Let Z be a family of non-empty finite sets. Without loss of generality, 
assume that the members of Z are pairwise disjoint. We form a new family 

Y = {y : y ^ 0 A (3s <E Z)y C z}, 

i.e. the closure of Z under taking non-empty subsets. Since Z C Y, any choice 
function for Y immediately gives a choice function for Z. 

Let X = (JT, let F = F(X) be the free group on X , and let K < F be the 
subgroup defined by 

K = ({wx^ 1 : (3y £ Y)w,x £ y}). 

By the Nielsen-Schreier theorem, K has a basis B. 

For each n < w, let Y^ = {y £ Y :\y\= n} and F(- n ) = {y £Y : \y\ < n}. By 
induction on n, we will find a choice function c n on Y^- n> for each 2 < n < ui. 
By construction, the c n will be nested, so that {J 2 < n<ul c n is a choice function 
for Y. 

A choice function c 2 on W- 2 ) is guaranteed by lemma [5] 

Assume that n > 3 and that there is a choice function c n -i for For 

every y £ r(") we define a function s y by 

Sy : y ^ y * x i y c n -i(y \ {x}). 

Note that, as Y is closed under taking non-empty subsets, y \ {x} £ so 

c n -i(y \ {a^}) is defined. There are four cases: 

(i) s y is not a bijection: 

In this case, |{s y (x) : x £ y}\ < n — 1, so defining 
c n {y ) = c n -i({s y (x) : x £ y}) 

gives a choice for y. 

(ii) s y is a bijection with at least two orbitijj]: 

Since there are at least two orbits, each orbit has size < n — 1. Moreover, 
as s y (x) ^ x for all x £ y 1 the number of orbits is also < n— 1. So choosing 
one point from each orbit, and then choosing one point from among the 
chosen points gives a single element of y. More specifically, if we write 
orb{x) for the orbit of x £ y under s y , we define 

Cn(y) = c n -i({c n -i(orb(x)) : x £y}). 

1 Thanks to Thomas Forster for suggesting a simplification of this part of the proof 
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(iii) s y is a bijection with one orbit, and n is even: 

If n is even, Sy is a bijection with two orbits. Remembering that we are 
assuming n > 3, this gives us < n — 1 orbits of size < n — 1 each. A choice 
is made as in the previous case. 

(iv) s y is a bijection with one orbit, and n is odd: 

Notice that, for any x € y, y = {x, s v (x), Sy{x ),..., Sy~ l (x)}. s y (x) may be 
viewed as the successor of x. For simplicity, we set Xi = s’ y (x) for i 6 Z, 
so that y = {x 0 , £i,..., a: n _i}. 

In order to further simplify our notation, we shall assume that the elements 
of y(") are pairwise disjoint. Of course, this is not possible when Y is 
constructed as above. But replacing every y € Y with y x {y} makes no 
difference to the argument, so the proof carries over without any changes. 

Recall the basis B of the subgroup K defined earlier in the proof. We may 
write 


x 0 x x 1 

= ^0,1 • • • 

bo , i 0 

-1 

X \ X 2 

= 61,1 - - - 

bi, h 

% n — 1*^0 

= 1,1 

*' * b n 


as reduced R-words, with bij £ for all i,j. First, we make two simpli¬ 
fications: 

(a) If it is not the case that Iq = ... = l n -\, let l = min{li : i = 0,..., n— 1}. 
Then {xi : li = 1} is a proper non-empty subset of y , and we define 

£n (y) = Cn—l{{%i '• li = 0 )* 

From now on it is assumed that Iq = ... = l n -\ = l , say. 

(b) Note that 

(xoX^XziZ^ 1 ) • • • (Xu-iXq 1 ) = 1, 

i.e. 

(&o,i • • • &o,z)(X,i • • • M • • • {b n - 1,1 • • • b n -i t i) = 1. (5) 

For i = 0,..., n — 1, let ki be the number of R-cancellations in 

(bi,i ■ ■ ■ b it i)(b i+ i,i • • • b i+ i t i). ( 6 ) 

If it is not the case that ko = ... = k n - 1 , let k = min{ki : i = 
0, ...,n — 1}. Then {xi : ki = fc} is a proper non-empty subset of y, 
and we define 

C n {y) = Cn-i({Xi : ki = k}). 

From now on it is assumed that ko = ... = k n -i = k, say. 
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As letters always cancel in pairs, (0) implies that nl is evenj^l Since we 
are assuming that n is odd, it follows that l is even. Define m = 1/2, and 
note that k > to: if not, then complete cancellation in (0) would not be 
possible. This allows us to define functions f y and g y , as in the proof of 
lemma 0 


fy : y -> K : Xi i->- b it i ■ ■ • b iiTn 
9y ■ y~>F: x^f y (x)~ 1 x. 

Since there are k > m cancellations in (0, we have 6j+i,i = b ~/,..., = 

b~l_ m + 1 = Kl+i f° r a ll *• By the same calculation as in 0, it follows 
that 

fy( x i)fy( x i+ 1) = XiX 

for all i, and hence that g y {xi) = g y (xi+i) for all i. So g y : y —> F is a 
constant function, taking a single value a y , say. The same calculation as 
0 yields 

a y( a y ) = 1 - 

So we set c n (y) to be the first y- letter occurring in the X-reduction of a y . 

□ 

Whether or not the Nielsen-Schreier theorem is equivalent to the Axiom of 
Choice still remains an open question. A positive answer might be obtainable 
by adapting the proof of theorem 0 Finiteness of the sets was used to define 
the choice function recursively, splitting up in cases (i) - (iv). Cases (i) - (iii) 
were easily dealt with. Case (iv) gave us a cyclic ordering on the finite set - 
enough structure to use the basis of the subgroup K to choose a single element. 
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